Search CORE

53 research outputs found

Pattern matching and pattern discovery algorithms for protein topologies

Author: C. Bron
C.A. Orengo
C.A. Orengo
D. Gilbert
D.R. Westhead
D.R. Westhead
H.M. Berman
I. Koch
J.J. McGregor
J.R. Ullmann
K. Hofmann
K. Zhang
L. Holm
P.A. Evans
T.P.J. Flores
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2001
Field of study

We describe algorithms for pattern matching and pattern learning in TOPS diagrams (formal descriptions of protein topologies). These problems can be reduced to checking for subgraph isomorphism and finding maximal common subgraphs in a restricted class of ordered graphs. We have developed a subgraph isomorphism algorithm for ordered graphs, which performs well on the given set of data. The maximal common subgraph problem then is solved by repeated subgraph extension and checking for isomorphisms. Despite the apparent inefficiency such approach gives an algorithm with time complexity proportional to the number of graphs in the input set and is still practical on the given set of data. As a result we obtain fast methods which can be used for building a database of protein topological motifs, and for the comparison of a given protein of known secondary structure against a motif database

Crossref

Brunel University Research Archive

Consensus clustering and functional interpretation of gene-expression data

Author: Kellam P.
Liu X.
Martin Nigel
Orengo C.A.
Swift S.
Tucker A.
Vinciotti V.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2004
Field of study

Microarray analysis using clustering algorithms can suffer from lack of inter-method consistency in assigning related gene-expression profiles to clusters. Obtaining a consensus set of clusters from a number of clustering methods should improve confidence in gene-expression analysis. Here we introduce consensus clustering, which provides such an advantage. When coupled with a statistically based gene functional analysis, our method allowed the identification of novel genes regulated by NFκB and the unfolded protein response in certain B-cell lymphomas

Springer - Publisher Connector

UCL Discovery

PubMed Central

Birkbeck Institutional Research Online

Spiral - Imperial College Digital Repository

Brunel University Research Archive

Stability-activity tradeoffs constrain the adaptive evolution of RubisCO

Author: Christin P-A.
Orengo C.A.
Studer R.A.
Williams M.A.
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 27/01/2014
Field of study

A well-known case of evolutionary adaptation is that of ribulose-1,5-bisphosphate carboxylase (RubisCO), the enzyme responsible for fixation of CO2 during photosynthesis. Although the majority of plants use the ancestral C3 photosynthetic pathway, many flowering plants have evolved a derived pathway named C4 photosynthesis. The latter concentrates CO2, and C4 RubisCOs consequently have lower specificity for, and faster turnover of, CO2. The C4 forms result from convergent evolution in multiple clades, with substitutions at a small number of sites under positive selection. To understand the physical constraints on these evolutionary changes, we reconstructed in silico ancestral sequences and 3D structures of RubisCO from a large group of related C3 and C4 species. We were able to precisely track their past evolutionary trajectories, identify mutations on each branch of the phylogeny, and evaluate their stability effect. We show that RubisCO evolution has been constrained by stability-activity tradeoffs similar in character to those previously identified in laboratory-based experiments. The C4 properties require a subset of several ancestral destabilizing mutations, which from their location in the structure are inferred to mainly be involved in enhancing conformational flexibility of the open-closed transition in the catalytic cycle. These mutations are near, but not in, the active site or at intersubunit interfaces. The C3 to C4 transition is preceded by a sustained period in which stability of the enzyme is increased, creating the capacity to accept the functionally necessary destabilizing mutations, and is immediately followed by compensatory mutations that restore global stability

PubMed Central

Birkbeck Institutional Research Online

White Rose Research Online

ISPIDER Central: an integrated database web-server for proteomics

Author: Belhajjame K.
Côté R.
Embury S.M.
Goble C.A.
Hermjakob H.
Hubbard S.J.
Jones D.T.
Jones P.
Martin Nigel
Oliver S.G.
Orengo C.A.
Paton N.W.
Pentony M.M.
Poulovassilis Alexandra
Selley J.N.
Siepen J.A.
Stevens R.
Zamboulis Lucas
Publication venue: 'Oxford University Press (OUP)'
Publication date: 25/04/2008
Field of study

Despite the growing volumes of proteomic data, integration of the underlying results remains problematic owing to differences in formats, data captured, protein accessions and services available from the individual repositories. To address this, we present the ISPIDER Central Proteomic Database search (http://www.ispider.manchester.ac.uk/cgi-bin/ProteomicSearch.pl), an integration service offering novel search capabilities over leading, mature, proteomic repositories including PRoteomics IDEntifications database (PRIDE), PepSeeker, PeptideAtlas and the Global Proteome Machine. It enables users to search for proteins and peptides that have been characterised in mass spectrometry-based proteomics experiments from different groups, stored in different databases, and view the collated results with specialist viewers/clients. In order to overcome limitations imposed by the great variability in protein accessions used by individual laboratories, the European Bioinformatics Institute's Protein Identifier Cross-Reference (PICR) service is used to resolve accessions from different sequence repositories. Custom-built clients allow users to view peptide/protein identifications in different contexts from multiple experiments and repositories, as well as integration with the Dasty2 client supporting any annotations available from Distributed Annotation System servers. Further information on the protein hits may also be added via external web services able to take a protein as input. This web server offers the first truly integrated access to proteomics repositories and provides a unique service to biologists interested in mass spectrometry-based proteomics

PubMed Central

Birkbeck Institutional Research Online

The Australian National University

The University of Manchester - Institutional Repository

Computation of protein geometry and its applications: Packing and function prediction

Author: A. Bondi
A. Goede
A.C. Wallace
A.C.R. Martin
A.E. Todd
B. Lee
B.J. Gellatly
C. Hu
C.A. Orengo
D. Fischer
F. Glaser
F.M. Richards
F.M. Richards
F.M. Richards
F.M. Richards
F.M. Richards
G. Rhodes
G.M. Crippen
H. Edelsbrunner
H. Edelsbrunner
H. Edelsbrunner
H. Edelsbrunner
H. Edelsbrunner
H. Edelsbrunner
J. Liang
J. Liang
J. Liang
J. Liang
J. Tsai
J. Word
J. Zhang
J.L. Finney
K.W. Kratky
L. Guibas
L. Holm
M. Levitt
M. Petitjean
M.L. Connolly
O. Lichtarge
P. Røgen
P. Røgen
P.J. Artymiuk
R. Bader
R. Norel
R. Russell
R.A. Laskowski
R.K. Singh
S. Chakravarty
T. Binkowski
T. Binkowski
T.A. Binkowski
T.A. Binkowski
T.J. Richmond
W. Rieping
W. Zheng
X. Li
X. Li
Y. Harpaz
Y. Tseng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 14/01/2006
Field of study

This chapter discusses geometric models of biomolecules and geometric constructs, including the union of ball model, the weigthed Voronoi diagram, the weighted Delaunay triangulation, and the alpha shapes. These geometric constructs enable fast and analytical computaton of shapes of biomoleculres (including features such as voids and pockets) and metric properties (such as area and volume). The algorithms of Delaunay triangulation, computation of voids and pockets, as well volume/area computation are also described. In addition, applications in packing analysis of protein structures and protein function prediction are also discussed.Comment: 32 pages, 9 figure

arXiv.org e-Print Archive

Crossref

Pattern matching and pattern discovery algorithms for protein topologies

Author: H.M. Berman
C. Bron
P.A. Evans
T.P.J. Flores
D. Gilbert
K. Hofmann
L. Holm
I. Koch
J.J. McGregor
C.A. Orengo
C.A. Orengo
J.R. Ullmann
D.R. Westhead
D.R. Westhead
K. Zhang
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2001
Field of study

arXiv.org e-Print Archive

DESY Publication Database

Crossref

DESY

CERN Document Server

Brunel University Research Archive

Sequences and topology. Genes and structures in context

Author: Bork P.
Orengo C.A.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2004
Field of study

MDC Repository

DOMPLOT: a program to generate schematic diagrams of the structural domain organization within proteins, annotated by ligand contacts

Author: A.E. Todd
C.A. Orengo
J.M. Thornton
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref

PFDB: a generic protein family database integrating the CATH domain structure database with sequence based protein family resources

Author: Johnson Roger
Kellam P.
Martin Nigel
Orengo C.A.
Shepherd Adrian J.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/12/2002
Field of study

Motivation: The PFDB (Protein Family Database) is a new database designed to integrate protein family-related data with relevant functional and genomic data. It currently manages biological data for three projects—the CATH protein domain database (Orengo et al., 1997; Pearl et al., 2001), the VIDA virus domains database (Albà et al., 2001) and the Gene3D database (Buchan et al., 2001). The PFDB has been designed to accommodate protein families identified by a variety of sequence based or structure based protocols and provides a generic resource for biological research by enabling mapping between different protein families and diverse biochemical and genetic data, including complete genomes. Results: A characteristic feature of the PFDB is that it has a number of meta-level entities (for example aggregation, collection and inclusion) represented as base tables in the final design. The explicit representation of relationships at the meta-level has a number of advantages, including flexibility—both in terms of the range of queries that can be formulated and the ability to integrate new biological entities within the existing design. A potential drawback with this approach—poor performance caused by the number of joins across meta-level tables—is avoided by implementing the PFDB with materialized views using the mature relational database technology of Oracle 8i. The resultant database is both fast and flexible. This paper presents the principles on which the database has been designed and implemented, and describes the current status of the database and query facilities supported

Crossref

Birkbeck Institutional Research Online

The CATH Dictionary of Homologous Superfamilies (DHS): a consensus approach for identifying distant structural homologues

Author: A.E. Todd
C.A. Orengo
F.M.G. Pearl
J.E. Bray
J.M. Thornton
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref